Cohesive Links with Literal and Idiomatic Expressions in Discourse: An Empirical and Computational Study
نویسندگان
چکیده
Lexical cohesion is an important device for signalling text organisation. In this paper, we investigate to what extent idiomatic expressions participate in the cohesive structure of a text. We look at the problem both from an empirical and a computational perspective. Our results show that both literal and nonliteral expressions exhibit cohesion with their context, though the latter tend to do so to a lesser extent. We also show that cohesive links identified by humans can be approximated by an automatically computable semantic relatedness measure based on search engine page counts.
منابع مشابه
Unsupervised Recognition of Literal and Non-Literal Use of Idiomatic Expressions
We propose an unsupervised method for distinguishing literal and non-literal usages of idiomatic expressions. Our method determines how well a literal interpretation is linked to the overall cohesive structure of the discourse. If strong links can be found, the expression is classified as literal, otherwise as idiomatic. We show that this method can help to tell apart literal and non-literal us...
متن کاملA Cohesion-based Approach for Unsupervised Recognition of Literal and Nonliteral Use of Multiword Expression
Texts frequently contain expression whose meaning is not strictly literal, such as idioms. Idiomatic and non-literal expressions pose a major challenge to natural language processing technology as they often exhibit lexical and syntactic idiosyncrasies. We propose a novel unsupervised method for distinguishing literal and non-literal usages of expressions. Our method determines how well a liter...
متن کامل(Un)Translatability of Persian Idiomatic Expressions to English in Political Discourse
The present study sought to investigate the extent to which Persian idiomatic expressions would influence the western translators' strategies in providing the ultimate product in English, and it also attempted to uncover the underlying assumptions in target text, then to suggest some weighty strategies to overcome difficulties with translation. For this purpose, the data was analyzed within the...
متن کاملDetecting and Processing Figurative Language in Discourse
Figurative language poses a serious challenge to NLP systems. The use of idiomatic and metaphoric expressions is not only extremely widespread in natural language; many figurative expressions, in particular idioms, also behave idiosyncratically. These idiosyncrasies are not restricted to a non-compositional meaning but often also extend to syntactic properties, selectional preferences etc. To d...
متن کاملAspectual Composition in Idioms
Idiomatic utterances contribute by virtue of their aspectual class to the temporal structure of discourse – hence it is essential that theories of aspect derivation and discourse structure should encompass them. A recent claim has been made (McGinnis 2002) that the aspectual class of idiomatic expressions can be derived compositionally, by the same process as in literal language. We examine thi...
متن کامل